Data Science and Statistical Learning Course Final Project.
Figure 1: Due to Comaptility issue with GitHub Pages, please go to https://samuellee19.shinyapps.io/SFQ_Final_Project/ to use interactive dashboard.
The Stop, Question and Frisk program is a practice, utilized by New York City Police Department (NYPD), of temporarily halting, questioning, and, in certain cases, searching civilians on the street for weapons and other dangers. It is also called “Terry Stop,” named after the Supreme Court case Terry v. Ohio (1968).
The use of Stop, Question and Frisk practice is often endorsed with the Broken Windows Theory, suggesting that even low-level crimes and civil disorder leads to more serious crimes in urban enviornments. In fact, after NYPD officer Adrian Schoolcraft made extensive recordings on the department’s Stop and Frisk policy, numerous civil rights organizations, such as NYCLU, raised a concern that the program unfairly targest certain minorities, such as African-Americans and Hispanic-Americans. Similarly, the Guardian reporter also noted that “since 2002 (the Kelly/Bloomberg years), over 35% of officers feel high pressure to write stop-and-frisk reports…felt less pressire to obey consitutional constraints.”(Eterno and Silverman 2013)
A major turning point was the 2013 court case Floyd v. City of New York and a subsequent NYPD mandate that requires officers to thoroughly justify the reason for making a stop. After Bloomberg’s mayorship ended, Bill de Blasio took over the office and pledged for a major reform on NYPD policies.
For this project, we will be using New York City data on Stop and Frisk practice, from 2003 to 2018. Due to formatting changes happened circa 2017, data from 2017-2018 period will be limited to visualization process. Hence, we will determine changes from 2003-2007, 2007-2012, and 2013-2016 periods.
For this project, we mostly used library(mapdeck) to create screengrid and hexagon objects to show density of Stop and Frisk events across our timeline, as you can see from the above dashboard.
Figure 2: Spatial Points of Stop and Frisk Data (2017-2018) with Plotly and Mapbox
This is a basic interactive scatterplot of 2017-2018 Period data created on Plotly and Mapbox. We can see that spatial points are dispersed across all of New York City boroughs. While this scatterplot allows us to see where Stop, Question, and Frisk practices are happening, there is no way to be certain which areas have higher rate or density of occurance. In order to resolve the issue, we created a screengrid interactive that you can zoom in and out, tilt, and move around.
| Control Options | Description |
|---|---|
| Moving | Arrow Keys |
| Zoom | Scroll Wheels or Trackpad Zoom |
| Tilt/Orientation | Control (Command on Mac) + Shift + Arrow Keys |
It is also important to mention that the density is expressed by color. From the map, we can see that there are significantly more density in northern parts of and above Manhattan, such as East Harlem, Fordham, and Bronx. Since East Harlem and Fordham have significant minority population, it would be interesting to see whether race has an influence over decision-making of NYPD’s Stop, Question, and Frisk practice.
You can also conduct further interaction with more data points and options with a dashboard that we created on the top. You can also access the dashboard at https://samuellee19.shinyapps.io/SFQ_Final_Project/. (NOTE: some features could take some time to render due to size of our data)
To begin, we wanted to analyze whether stop and frisk actually practices seem to have any impact on the crime rate seen in an area. Therefore, using the data described above, we decided to visualize the effect of the stop and frisks on the crime rate of an area, by precinct. To do this, we first normalized the number of stops per precinct over a year. We then found the rate of change in the amount of crime per precinct for each year we had data on. Finally, we graphed this information, as shown below.


As we can see, there isn’t always a clear pattern in a relationship between the change of the number of crime and the scaled amount of stop and frisks seen in a precinct. If the stop and frisks did have an effect on crime, we would have expected there to be a strongly negative correlation. However, there doesn’t seem to be any relationship between these two values. When we seperate the values by year, we can also see sometimes there were some negative trends, which means the crime decreased as there were more stop and frisks, but also have some years with a positive trend meaning that areas with more stop and frisk also had an increase in crime. Therefore, there doesn’t seem to be a clear connection between the number of stop and frisks and the change in crime of an area. Therefore, this suggests that stop and frisks didn’t seem to serve the purpose they set out accomplish: the data doesn’t suggest clear alleviations in the cime rate.
Because the stop and frisks didn’t seem to be helping the crime rate in an area, we next wanted to figure out exactly how these stop and frisks were being conducted: what factors lead to someone being more likely to be chosen for a stop and frisk?
The first factor we analayzed which may lead to there being more stop and frisks in an area is how much crime happens there. We’d expect areas to have more crimes overall to also have a higher rate of stop and frisks, because the purpose of the stop and frisks would be to try and target these high crime areas. We therefore worked on visualizing a relationship between the amount of crime in an area and the amount of stop and frisks in an area, by year. We did this by standardizing the amount of crime per precinct by year, and then standardizing the amount of stop and frisks conducted per precinct by year. We then plotted these values, as shown below.


Through these graphs, we can see again there is no trend in values or clear positive trend, in fact there is a slightly negative trend. This means that even though areas had a high amount of crime, there wasn’t a corresponding high amount of stop and frisks happening in that area. One may think this is due to the stop and frisks decreasing the amount of crime in that year, but as we saw in the above analysis there was no relationship between these two variables. Therefore, we can see that stop and frisks don’t affect the crime rate, and it’s also not based on where there is more crime. Therefore, there must be other factors in play, in determining which individuals were more likely to be stopped and frisked.
Eterno, John, and Eli Silverman. 2013. “Mike Bloomberg’s Fact-Free Defence of Stop-and-Frisk.” The Guardian, September. https://www.theguardian.com/commentisfree/2013/sep/11/stop-and-frisk-michael-bloomberg.